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Abstract 

We analyze signal detection with nonlinear test statistics in the presence of colored noise. In the limits of small signal and 
weak noise correlation, the optimal test statistic and its performance are derived under general conditions, especially 
concerning the type of noise. We also analyze, for a threshold nonlinearity-a key component of a neural model, the 
conditions for noise-enhanced performance, establishing that colored noise is superior to white noise for detection. For a 
parallel array of nonlinear elements, approximating neurons, we demonstrate even broader conditions allowing noise- 
enhanced detection, via a form of suprathreshold stochastic resonance. 
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Introduction 

Stochastic resonance has emerged as a significant statistical 
phenomenon where the presence of noise is beneficial for signal 
and information processing in both man-made and natural 
systems [1-11]. The excitable FitzHugh-Nagumo (FHN) neuron 
model has been discussed for exploring the functional role of noise 
in neural coding of sensory information [12]. Following this, the 
milestone concept of aperiodic stochastic resonance using the 
FHN neuron model [13] stimulated a number of interesting 
investigations in sensory biology [3,7,14,15] and physiological 
experiments [6,8,16-20]. Due to the character of activity in the 
nervous system, the neuron coding strategy based on stochastic 
resonance is also found in threshold (level-crossing) [21-23] and 
threshold-free [24—28] neurons. Since there are large numbers of 
neurons in the nervous system of animals and humans with 
variations in structure, function and size [2,4,7-9], then the 
potential exploitation of stochastic resonance in a neuron bundle 
becomes an interesting open question in neuroscience. In a general 
summing neural network, Collins et al. [4] reported that the noise 
intrinsic to each neuron could be used to extend the operating 
range of the sensitivity of the overall system. This, however, is not 
a unique case. In the summing multi-threshold network, supra- 
threshold stochastic resonance discovered by Stocks [29] over- 
comes the restriction of subthreshold signals, and appears to offer a 
possible explantation of dc adaptation in sensory neurons [9,30]. 
One-dimensional coupling [31] and spatio-temporal stochastic 
resonance [7,32] show that not only an optimal noise intensity but 
also an optimal coupling strength exists. Recent stochastic 
resonance research in complex networks [3 3 — 4-0] also demon- 
strates that an interconnected network configuration, as well as the 
non-zero noise level, can be optimized to achieve the best system 
performance. 



In many practical situations, the idealization of white noise is never 
exactly realized [2,5]. Consequently, the effect of colored noise on 
stochastic resonance has been investigated using the measure of 
output signal-to-noise ratio of a periodic signal [2,5,16,41^43]. 
Although the suppression of stochastic resonance with increasing 
noise correlation time was demonstrated [2,5,4 1-43] , it is interesting 
to note that, under certain circumstances, colored noise can be 
superior to white noise for enhancing the response of a nonlinear 
system to a weak signal [16,44]. In the field of signal detection, the 
employment of noise to enhance signal detectability also becomes a 
possible option [45-55]. However, in most of these studies, the 
observed noise samples are often assumed to be independent. 
Colored noise for signal detection [56-60] is not adequately 
investigated in the context of stochastic resonance. In this article, 
we focus on the weak signal detection problem with the beneficial role 
of additive colored noise in threshold neurons. Because of the "all- 
and-none" character of nerve activity [61], the problem of threshold- 
based neural signal detection can be considered as a statistical binary 
hypothesis test [7,23,27] . In this situation, explicit expressions for the 
maximum asymptotic detection efficacy are derived for a given 
transfer function of neuron model. We prove that colored noise that 
arises from a moving-average model is superior to white noise in 
improving the detection efficacy of neurons. It is illustratively shown 
that, for a single neuron with a signum threshold nonlinearity, the 
possibility of noise-enhanced detection only holds for non-scaled 
noise. For scaled noise, the effect of noise-enhanced detection does 
not occur in a single neuron model. However, when we tune the 
internal noise components of a parallel array of threshold neurons, it 
is observed that the constructive role of noise comes into play again in 
improving the signal detection efficacy, wherein suprathreshold 
stochastic resonance manifests its potentiality. 
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Results 



where the Cauchy-Schwarz inequality yields 



Detection model 

Consider the detection problem formulated as a binary 
hypothesis test [23,60,62] 



H 0 : x = z, 
Hi : x = z 4 



(1) 



with the Fisher information matrix J = E [( - '"{^ )( - ) T ] ■ Note 
that the equality of Eq. (6) is satisfied by the locally optimum 
nonlinearity 



Under hypothesis Ho, the observation vector x= [xi,x 2 , ■ ■ ■ ,Xff] 
consists of noise z= [zi,z 2 , • • • ,z/v] T only, and under hypothesis Hi 
it consists of noise z and known signal s= [s\,s 2 , ■ ■ ■ ,s^] T with its 
strength 6. There exists a finite bound U s such that 0< \s„\ < U s , 
and the asymptotic average signal power satisfies 
0<P, = liniAf^oo s T s/N< oo [56-58,60,62]. Next, the test statis- 
tic 7X X ) is compared with a decision threshold y to decide the 
hypotheses, as 



T(x) = c T g(x)^y, 



(2) 



where the coefficient vector c=[ci,c 2 , ■ ■ ■ ,Cjy] is associated with 
the function g(x) to form 7"(x). 

Assume the TV-dimensional probability distribution /(z) of noise 
z and zero-mean vector of E[g(z)] = [ g(z)f(z)dz (for a shift in 
mean) [58,60]. Then, for a large sample size N of observation 
vector x, the test statistic T(x) has zero-mean and asymptotic 
variance 



var(r(x)|// 0 ) = c r E[£(z)g(z) r ]c, 



(3) 



under hypothesis Ho. Furthermore, for weak signals (6 — >0) and 
under hypothesis Hi , g(x) can be expanded to the first-order 



g(x)=g(z + 8s) X g(z) + 8^ s. 

oz 



(4) 



Then, the characteristics of 7"(x) under H\ , up to the first-order in 
9, can be calculated as 



E(r(x)|// 1 )«(?c r E[ 5 ^ ) ] S , vai<r(x)|#i)«vai(r(x)|flb)- (5) 

Under both hypotheses Ho and Hi, the test statistic T(x), 
according to the central limit theorem, converges to a Gaussian 
distribution. Thus, the binary hypothesis test of Eq. (1) becomes a 
Gaussian mean-shift detection problem [60,62]. Given the false 
probability, the detection probability is a monotonically increasing 
function of the detection efficacy £(7") [60,62] given by 



i(T)= lim (aE[r ( x)| gl] / 3fl | r ) 

v ' n^oo vai[T(x)\H 0 ] 



(c r E[5g(z)/gz] S ) 2 
c r E^(z)gr (z )] c 

(c r E[g(z)(51n/(z)/5z) r ]s)' 



(6) 



c T E[g(z)g(z) J ]c 



< s T J s, 



g opt (z)=C 



51n/(z) 



(8) 



for an arbitrary constant C. 

However, a complete closed-form noise distribution /(z) may be 
unavailable, especially in unknown noisy circumstances [56— 
58,60,62], which makes the nonlinearity of Eq. (8) difficult or too 
complex to implement. Thus, there may be compelling reasons for 
considering the given function g with an easily implemented 
feature. In this case, the detection efficacy in Eq. (6) can be 
maximized as 



«2> 



(c T E[dg(z)/dz] S y 
c r E[g(z)^(z)]c 

[(L r c) r (L- 1 E[%(z)/5z]s)] / 



(9) 



(L-c) J (L^c) 



\3g(z)~ 


T 

V _1 E 


'Sg(i) 


. 5z . 




. 5z . 



<s r E 



with the Cholesky decomposition of the variance matrix 
V = E[g-(z) ( s*(z) r ] =LL r and by optimally choosing the coefficient 
vector c opt = K E[g(z)g(z) T ]~ X E[dg(z) / dz]s for an arbitrary con- 
stant K. 

Colored noise 

Consider a useful colored noise model of the first-order moving- 
average [56,59] as 



Zi= PiWi-l+Wi + p 2 w i+ i, 



(10) 



where the correlation coefficients are p\ 2 and 
W= [wi,W2, • • • ,Wn] T is an independent identically distributed 
(i.i.d.) random vector. For small values of p xl l \Pn\ < ^^), the 
dependence among noise samples z, will be weak [56,59]. Here, 
we assume M>, have an univariate distribution f w (w) that is 
symmetric about the origin. We also assume the memoryless 
nonlinearity g(z) = \g(zi),g{z2), • • • ,g{z^)] T to be odd symmetric 
about the origin. Then, up to first order in small correlation 
coefficients p\ j2 , we can expand g"(z,) as 



g(z,) mg(Wi) + (pi 1 + p 2 w i+ i)g'(w i ), 



ill) 



g'(z i )xg'(w i ) + (PiWj-i+p 2 Wi + i)g"(w i ), (12) 
and obtain expectations 



E[g'(z,)]*E[g'(vv)], 



(13) 
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E[f(z,-)]«E[^(wO 



(14) 



E[g( Zi )sfe +,)] *(p, +p 2 )E[wg(w)]E[g»}- (15) 
Therefore, we have the expectation matrix 



Since, from its definition in Eq. (6), the efficacy e(g,p) is non- 
negative, the denominator in Eq. (21) must satisfy 



E[g 2 (n-)] -2|p, +p 2 |E[vvg(,r)]E[g'(u<)] >0. 



(22) 



In order to validate Eq. (22), we use the Cauchy-Schwarz 
inequality to yield 



dg(zj 



(16) 



with the unit matrix I, and the variance matrix V = E[g(z)g"(z) r ] 
has elements 



V w = E[g 2 (w)], 



(17) 



E 2 [#1 <E[ W 2 ]E[g 2 (iv)] =^E[g 2 0v)], 



E 2 [g'(vv)] <E[(^? ) 2 ]E[g 2 (w)] =/(/|„)E[g 2 ( 1V )], (24) 

with the Fisher information quantity J(fw) = E[(fw/fw) ] an d the 
variance <7 2 of noise distribution of f w [56] . Thus, we find 



(23) 



V y+ ,=V j+M = (Pi +P 2 )E[wg(w)]E[/(v 



(18) 



for != 1,2, • • • ,N. Then, based on Eq. 
detection efficacy e(g) can be computed as 



the normalized 



e(g,p)-- 



j(T) 

s T E[8g(z)/8z] T \- l E[dg(z)/dz]$ 



(19) 



= E 2 [g'( W ) 
. E 2 fe'(u-) 



Here, when the equality of Eq. (19) is achieved, the signal s is the 
corresponding eigenvector to the minimum eigenvalue A m i n of the 
matrix V. It is known that the eigenvalues of the matrix V are [62] 



A/ = Vy + 2Vj |I - + iCOS 



7Y+1 



= E[g 2 (w)} +20, +p 2 )E[wg(w)]E[g'(w)} cos 



N+l 



(20) 



corresponding to eigenvectors v, = sin (nik/( N + 1 )) for 
k= 1,2, • ■ ■ ,N. Here, as the nonlinearity g is assumed to be odd, 
it is then found that E[u'g(u')] >0 and E[g'(u')] >0. Therefore, if 
p l +p 2 <0 and for a large sample size N, we take i=\ and 
A min = A i xE[g 2 (\v)}+2(p l +p 2 )E[wg(w)}E[g'(w)\. Otherwise, we 
choose X min = X N »E[g 2 (w)}-2(p 1 +p 2 )E[wg(w)}E[g'(w)\. An il- 
lustration of the eigenvector Vjy is shown in Fig. 1 for N = 200. In 
this way, by optimally choosing the input signal (eigenvector) s = Vi 
(Vjv), the maximum efficacy e(g,p) can be calculated as 



<?m a x(g,p)= max e(g,p) 

S 

E 2 [g'(u<)] 



V«-2|Vy+i| 

E[g\w)}-2\ Pl +p 2 \E[wg{w)] E[g'(w) 



(21) 



E[wg(w)] E[g'(w)] < yj<rlJ(f v ) E[g 2 {w)\ 
Substituting Eq. (25) into Eq. (22) and noting 

a 2 v J(f n ) = E[,r 2 ] e[(^ f\ > e[w&\ = 1 , 

Jw Jw 

we have 



1 1 



(25) 



(26) 



(27) 



Since we assume | /?i 2I <K 1 , the inequalities of Eqs. (27) and (22) 
can be satisfied, and the detector efficacy in Eq. (21) will be 
theoretically analyzed in the following. 

For white noise vector z with zero correlation coefficients 
Pi 2 = 0, the detection efficacy e{g,p) in Eq. (21) satisfies 



E 2 [g'Qv)] 



(28) 




200 



Figure 1. Eigenvector Vjy. An illustration of the eigenvector of the 

variance matrix V (JV = 200). 

doi:1 0.1 371/journal.pone.0091 345.g001 
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Thus, for a given function g, colored noise is superior to white 
noise in enhancing the detection efficacy, at a cost of optimally 
matching the input signal with the eigenvector v,- of covariance 
matrix V. 

Stochastic resonance in threshold-based neurons 

We will illustratively show the possibilities of noise-enhanced 
detection in threshold-based neurons. The classical McCulloch- 
Pitts threshold neuron has the form 



*<*) = 



l,x>£, 
0,x<£, 



(29) 



with the response threshold £. It is seen that g can be expressed as 
a function of x in terms of the signum (sign) function as 
g(x) = j sign(x — £) + j. Since the constant factor 1/2 does not 
affect the detection efficacy of the transfer function g, then we 
focus on the signum function 



g(x) = sign(x), 



(30) 



with response threshold £ = 0 in the following parts. Here, the 
signum function g is not continuous at x = 0, but has its derivative 
g'(x) = 23(x) for any x [60]. 

For the colored noise model of Eq. (10), the correlation 
coefficient p 2 = 0 indicates the noise sequence z, is a causal process 
that can be physically realized. Here, we assume P\=p (|p|«l) 
and p 2 = 0, and show the possibility of stochastic resonance in the 
physically realizable noise environment. First, consider scaled 
noise w(t) = G w Wo(t) that has the distribution f w ( w ) = 
fw„( w / a w)/ ff w [23,60]. Here, !4>o(0 has a standardized distribution 
f Wo with unity variance o 2 m = \. Thus, based on Eq. (21), the 
absolute moment is 

E[wg(»] =E[w sign(vv)] =E[|vv|] =ct„,E m , 0 [|wo|], (31) 

where the operator E W() [-]= J ■f WQ (wo)dwo- Thus, for the signum 
function g, the detection efficacy of Eq. (21) can be expressed as 



<?mnx(g",P) = 



4/ B 2 (0) 



4*,; 2 /4(0) 



1 -4|p|£(0)E[M] 1 -4|p[^ 0 (0)E u , 0 [K|] 



(32) 



It is seen in Eq. (32) that e m&x (g,p) is a monotonically decreasing 
function of noise variance a 2 v , and no noise-enhanced detection 
effect will occur in such a single neuron model for scaled noise. 

We further consider non-scaled Gaussian mixture distribution 
[5,47,48,60] 



fw(x)-- 



exp(- 



-jx-p) 1 
2e 2 



)+ exp(- 



2e 2 



where the variance c 2 = p 2 + e 2 and parameters p,e > 0. Then, for 
the signum function g in Eq. (30), the detection efficacy of Eq. (21) 
can be computed as 



-%exp(-^) , 
l-2\p\ [|exp(- ^) + ^exp(- ^)erf(^)] 

where the error function erf (x) = 2/ ^/n [* exp( — t 2 )dt. In Fig. 2, 



for the correlation coefficient |p|=0.2 and different values of 
p = 0.3,0.5 and 1, we show the detection efficacy of Eq. (34) as a 
function of noise variance a 2 v . For a given non-zero value \i and as 
e— >0, the noise distribution model of Eq. (33) indicates the 
dichotomous noise [5,48,53]. In this situation, as the signal 
strength 0— >0 and 1 9s„ | « p, the signum function g will not change 
its output whether the signal appears or not. Therefore, the test 
statistics T(x) = c T g(x) in Eq. (2) will be the same value under 
hypotheses Hq and Hi, and the detection efficacy e max (g,p) in Eq. 
(34) starts from zero. This explantation can be also validated by 
Eq. (34) as e->0 and fi being fixed, as illustrated in Fig. 2. 
However, it is clearly seen that, upon increasing the noise variance 
a 2 , =e 2 + fL 2 (actually increasing e), the noise-enhanced detection 
effect appears. The smaller the parameter ji is, the more 
pronounced the resonant peak of e may .(g,p) becomes, as shown 
in Fig. 2. 

Next, an interesting problem is that, for scaled noise, can we 
observe the noise-enhanced detection effect in threshold-based 
neurons? The answer to this question is affirmative. Here, we will 
resort to the constructive role of internal noise for improving the 
performance of an array of threshold neurons. Let 

x m = [xi im ,X2,m> ' ' ' > x N,m] T be the vector of N observation 
components at the m-th element of receiving array of M identical 
neurons. In this observation model, x,; m =z, A-yi, m under the 
hypothesis Hq. Here, in each neuron element, the M noise terms 
y m are assumed to be mutually independent with the same PDF /J, 
and variance a 2 Then, at the observed time i, the array outputs 
are collected as g(x,)= Ylm=l 8( x i,m)/M, and the test statistics 
can be reconstructed as T cl c(x) = c :r f(x) with 

g(x) = [g(x\),g(x2), - ■ ■ ,g(xN)] T ■ For the colored noise model of 
Eq. (10) with p l = p and p 2 = 0, we have 



g'{x i )Kg'{v i ) + pw i -ig"(Vi), 



(35) 



(36) 



where the composite noise v, = vv, +^', has the convolved 
distribution f v (v)= \f w (y — u)f y (u)du. Then, we have expectations 



1 . 

Ev[f'(x,)] = T7 E EvferUx)] *E v [g'(v)], (37) 



and 



8x 



<E v [g'(v)] I, 



(38) 



(33) wl 



with the operator E v [-]= [ -f v (v)dv. The variance matrix 
V = E v [j-(x)£(x) :r ] has elements 



V,,,=E v [f 2 (x,) 



T72 E E E y[8(xi,m)g(xiJ]\ 



1 

M 



- M 2 

m = i /j — i 

E,fe 2 (v)] + ( -^^E w {El[g(w+y)}}, 



(39) 
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} 0 1 2 3 4 5 6 

noise variance a 

w 



Figure 2. Stochastic resonance of a single threshold neuron. 

Detection efficacy of e m!lx (g,p) as a function of noise variance 
a\, = jr + e 2 for the correlation coefficient \p\ = 0.2 and different values 
of ^( = 0.3 (red), 0.5 (blue) and 1 (green). The resonant peaks of e max (g,p) 
are marked by the square (■), the star (*) and the down triangle (T) for 
^ = 0.3,0.5 and 1, respectively. Here, the transfer function g(x) = sign(x), 
and the noise distribution is Gaussian mixture model of Eq. (33). 
doi:1 0.1 371 /journal.pone.0091 345.g002 



V/./+ 1 = V,- + M = E v [g(xj)g(x i+ i)] 

{I M M -J 
T72 Yl Yl E y[s(Xi,m)g(Xi+l^)]\ 

w=ln=l (40) 

= j^E w {M%[g(xMx i+ i)]} 

* P E lv {n-E y [g(w+y)]}E v [g'(v)]. 

Then, based on Eqs. (38), (39) and (40), the maximum efficacy 
fmaxC?>p) can be computed by Eq. (21) as 

Wfep) = max e(g,p) = A ^ )l . (41) 
V u -2\V u+ i\ 

For instance, we assume the initial Gaussian noise components 
Wj have the distribution of f w (x) = , 1 exp ( — A ) and the 

given variance cr 2 . The internal noise components of each neuron 
is assumed to be the uniform random variable _y, with its 
distribution fy(x) = 1 / (2b) for —b<x<b and zero otherwise. The 
composite random variables V; are distributed by 




0 2 4 6 8 

uniform noise level b 




0 2 4 6 8 

uniform noise level b 



Figure 3. Suprathreshold stochastic resonance in an array of 
threshold neurons. Detection efficacy e mllx (g,p) as a function of the 
internal uniform noise level b and the neuron array size M. From the 
bottom upwards, M = 1,2,5, 1 0, 100,1 000, co. Here, the initial Gaussian 
noise level a w = 0.3, the transfer function g(x) = sign(.\'), the correlation 
coefficient (a) \p\ =0.2 and (b) \p\ =0.3. 
doi:10.1 371/journal.pone.0091 345.g003 

M and correlation coefficients p. It is noted that, at the uniform 
noise level b = 0, the detection efficacy in Eq. (41) is just the 
expression of Eq. (32) for a single neuron. Thus, e m:ix (g,p) = 9.49 
and 11.45 for \p\ =0.2 (see Fig. 3 (a)) and \p\ =0.3 (see Fig. 3 (b)), 
respectively. By comparing Fig. 3 (a) with Fig. 3 (b), it is seen that 
the maximum detection efficacy e m . AX (g,p) can be further 
enhanced for a higher value of p. For the array size M = 1 and 
upon increasing uniform noise level b, it is seen in Fig. 3 that there 
is no noise-enhanced effect in a single neuron. However, as M>2, 
it is illustrated in Fig. 3 that the internal uniform noise can 
enhance the detection efficacy e m ax(g",p), and the noise-enhanced 
effect does occur. Moreover, as the array size M increases, the 
noise-induced enhancement becomes more visible by adopting an 
appropriate amount of uniform noise of the neuron array, as 
shown in Fig. 3. As the detection problem so far is confined to the 
weak signal with its strength 9— >0 but 9>0, and the response 
threshold £ of all neurons is zero, thus Fig. 3 shows the potential 
capability of suprathreshold stochastic resonance in improving the 
detection efficacy of a parallel array threshold-based neurons. 

Methods 

Under the assumption of weak signals, the Taylor expansion of 
the function is utilized in Eqs. (4), (5), (1 1) and (12). The Cauchy- 
Schwarz inequality is used in Eqs. (7), (9), (23), and (24). The 
maximum of Rayleigh quotients for a symmetric matrix is 
calculated in Eqs. (19), (21) and (41). 



erf(^)-erf(^ 



For a given Gaussian noise level a w = 0.3, it is shown in Figs. 3 (a) 
and (b) that the maximum detection efficacy e mSLX (g,p) varies as a 
function of internal uniform noise level b for different array sizes 



Conclusion 

In this paper, we study the performance enhancement of 
threshold-based neurons for detecting weak signals in the presence 
of colored noise. For a given transfer function, we maximize the 
detection efficacy by optimally choosing the signal waveform. We 
prove that colored noise is superior to white noise in enhancing the 
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detection efficacy, at a cost of optimally matching the input signal 
with the eigenvector of the covariance matrix. Furthermore, we 
illustrate that, for a single threshold neuron, the possibility of 
noise-enhanced detection cannot occur in scaled noise, but does 
appear in a non-scaled Gaussian mixture noise model. Further- 
more, for scaled noise, we can test a parallel bundle of neurons 
with the same response threshold, and recover the positive role of 
internal noise in enhancing the detection efficacy of the neuron 
array via the mechanism of suprathreshold stochastic resonance. 
These results demonstrate that the strategy of exploiting stochastic 
resonance is still interesting in the case of improving the nonlinear 
system performance by adding more noise to the signal corrupted 
by colored noise. 

Here, we mainly consider the first-order moving-average noise 
model of Eq. (10) which is, as we show, amenable to analytical 
treatment. It is possible to extend the present approach to higher- 
order moving-average noise models. However, the same analytical 
treatment maybe no longer feasible. It is also interesting to 
consider yet other models of colored noise to enhance the 
detectability of the neuron array. This subject is very promising 
and currently under study. 

It is noted that the detection efficacy of Eqs. (6) and (9) are 
established under the assumption of weak signal strength 8 — >0. We 
only consider the first-order Taylor expansion of nonlinearities in 
Eq. (4), because it makes an analytical treatment possible and the 
corresponding results are rigorous. In practice, most noise 
distributions are symmetric and the nonlinear characteristics are 
odd symmetric about the origin. In this case, we can expand the 
nonlinearity to the second-order terms. The expectation of the 
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